Algorithm for Explaining Credit Scores 

RELATION TO OTHER PATENT APPLICATIONS 

5 

This document derives priority from provisional U.S. patent application serial 
no. 60/222,231, filed 8/1/00; and is related to U.S. patent application serial no. 
09/790,453, filed 2/22/01, which in turn claims benefit of provisional U.S. 
patent application serial no. 60/222,205, filed 8/1/00. 

10 

BACKGROUND OF THE INVENTION 
TECHNICAL FIELD 

15 The invention relates to credit scoring. More particularly, the invention relates 
to an algorithm for explaining credit scores. 

DESCRIPTION OF THE PRIOR ART 

20 Recent events have made it desirable for developers of credit scoring 
algorithms, such as Fair, Isaac and Company, Inc. of San Rafael, California 
(FICO) to move toward offering a service to deliver credit bureau risk scores 
and explanations directly to consumers and lenders. Consumer advocacy 
groups and credit counseling organizations have provided positive feedback 

25 on these announced intentions. Additionally, credit scoring developers 
clients, i.e. the credit grantors themselves, have expressed their 
understanding of the need to pursue this undertaking. Most organizations 
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are comfortable that each credit scoring developer, such as Fair, Isaac, is the 
only entity in the market that can actively take on the role of credit score 
delivery and explanation. 

5 A comprehensive score delivery and explanation service should include all of 
the following pieces: 

1 . Credit scores delivered to consumers. 
10 2. The primary reason codes that describe why the score was not higher. 

3. The consumer's credit bureau report from which the score was calculated 
to allow them to cross-reference the information with his/her actual credit 
report. 

15 

4. A personalized score explanation that describes to that consumer, in plain 
language, how their individual score was derived. This explanation service 
can be further enhanced using data elements present in the consumer's 
credit report. 

20 

Given the desirability of providing such information to consumers, it would be 
advantageous to provide a method and apparatus for explaining credit scores. 

A. Flint, D. Lear, C. St. John, Method and Apparatus for Explaining Credit 
25 Scores, U.S. patent application serial no. 09/790,453 (2/22/01) describe a 
Web site containing an array of informative resources including for-pay 
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services and extranet functions to serve consumers and traditional players in 
the financial services industry, including financial counselors, mortgage 
brokers, direct lenders, large national credit issuers, and third-party credit 
report re-sellers, plus information seekers such as the press, consumer 
5 groups, and government agencies. A primary focus of the Flint et al. 
invention is to educate consumers, consumer groups, and the consumer 
press by offering them access to the exceptionally high-quality information, 
both general and personal, about the practices of collection, storing, reporting, 
and evaluating consumer credit data. 

10 

It would be advantageous to provide an algorithm for explaining credit scores, 
for example in connection with a credit score explanation service. 

SUMMARY OF THE INVENTION 

15 

An exemplary Web-based score explanation service typically requires only 
the credit bureau identifier, credit score, and up to four reason codes as input. 
The invention herein discloses an algorithm that is used to provide an 
explanation of the primary factors influencing the score based upon a rich 
20 data feed. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a block schematic diagram showing targeted users, access and entry 
25 points, and services provided by myFICO.com; and 
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Fig. 2 is a flow diagram showing an algorithm for explaining credit scores 
according to the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

5 

The presently preferred embodiment of the herein described algorithm for 
explaining credit scores is provided for use in conjunction with a credit score 
explanation service that may be implemented in any of several embodiments, 
The preferred embodiment of the invention operates in conjunction with a 

10 Web site containing an array of informative resources including for-pay 
services and extranet functions to serve consumers and traditional players in 
the financial services industry, including financial counselors, mortgage 
brokers, direct lenders, large national credit issuers, and third-party credit 
report re-sellers, plus information seekers such as the press, consumer 

15 groups, and government agencies. A primary focus of such Web site is to 
educate consumers, consumer groups, and the consumer press by offering 
them access to the exceptionally high-quality information, both general and 
personal, about the practices of collection, storing, reporting, and evaluating 
consumer credit data. 

20 

The working title of the Fair, Isaac and Company (FICO) Web presence is 
myFICO (myfico.com) because the most visible elements of the service are 
aimed directly at consumers who want to learn about their own FICO score. 

25 Although the on-demand receipt of FICO scores is thought to be the primary 
draw to the site (based on consumer interest and press coverage), the 
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invention also offers access to additional valuable services, such as 
registration in an opt-in/opt-out database, the ability to initiate requests for 
credit investigations, the ability to link to consumer credit counseling services 
should scores be low and represent high risk, and the ability to access 
5 multiple reports from different repositories upon request. These services 
heighten the level of consumer education, and also offer individuals access to 
information, actions, and preferences they have not had previously. 

An additional benefit is to use myFICO.com to supply the consumer with their 
10 score and if that score is sufficient to pass the cutoff scores of specific brokers 
or lenders, the credit scoring developer can pass the consumer's name, 
application, and credit score on to the lender for consideration. The invention 
allows the credit scoring developer to build broker networks to refer these 
applicants to lenders who would approve them. The credit scoring developer 
15 can also link the applicants' email address to credit companies who wish to 
pre-approve and solicit these consumers based on score. This is a much 
more cost effective origination process (via email) than direct mail today. 

Fig. 1 is a block schematic diagram showing targeted users, access and entry 
20 points, and services provided by myFICO.com 10. Access by consumers 12 
may be through a credit reporting agency 13, , using an identification 
verification process to access credit score reports 17, for opt-in/opt-out 
requests 16, to access a report service 18, and to initiated on line 
investigations 19; through a secure, one time connection 15 for one-off credit 
25 score reports 20; or through an entirely anonymous access method 14 (the 
latter also allows access by government agencies 22) for consumer oriented 



5 



information 21. myFICO.com also provides an extranet logon facility 25 to 
credit score reports 37 for such users as financial counselors 24, mortgage 
brokers 26, and direct lenders 27; an automated application service provider 
entry 29 to credit score reports 30 and other reports 31 for such users as 

5 large credit issuers 28, on line financial service providers 32, and credit report 
resellers 33; and repository access 35 to credit score reports and other 
reports 36 for repository consumer representatives 34 and credit report 
resellers 33. See A. Flint, D. Lear, C. St. John, Method and Apparatus for 
Explaining Credit Scores, U.S. patent application serial no. 09/790,453 

10 (2/22/01). 

Score Explanation Service 

An exemplary Web-based score explanation service requires, for example, 
15 only the credit bureau identifier, credit score, and up to four reason codes as 
input. In contrast, the invention herein discloses an algorithm that is used to 
provide an explanation of the primary factors influencing the score based 
upon a rich data feed. This algorithm can be enhanced depending upon the 
amount of input data available, although the use of an enhanced algorithm is 
20 optional and not considered to be a key element of the subject invention. The 
actual explanations may be selected as appropriate for the application to 
which the invention is put. Typical explanations are those described in A. 
Flint, D. Lear, C. St. John, Method and Apparatus for Explaining Credit 
Scores, U.S. patent application serial no. 09/790,453 (2/22/01). 

25 

Fig. 2 is a flow diagram showing an algorithm for explaining credit scores 
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according to the invention. The invention comprises a score explanation 
algorithm, which could be applied to any score. The presently preferred 
embodiment of the invention provides the basis for a general web-based 
score explanation service (see Fig. 1). 

5 

Score Explainer 

Consider a score, which can be written as some function of a set of prediction 
characteristics (100) 

10 

Score = <;0Ci / X 2 '-/X c )/ 
where 

Xj = Prediction Characteristic j . 

For a credit bureau risk score, there are about 80 prediction characteristics, 
which can include, for example, such characteristics as the number of trade 

15 lines with a current delinquency, although the actual number chosen when 
practicing the herein disclosed invention can vary depending upon the 
particular application to which the invention is put. If the score is a single 
scorecard, i.e. a scoring model where the score for an individual is the sum of 
their characteristic scores, and where the number of terms in this sum is the 

20 number of characteristics, so that, for each characteristic, the individual is 
assigned a score weight and then their final score is the sum these score 
weights, then it can be written in the form 
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c 

Score = qj(Xj). 



If the score is a segmented scorecard, i.e. where the population is segmented 
5 into mutually exclusive segments, and where a separate scorecard model is 
developed for each segment, then the formula is more complicated. For 
example the score weight associated with a particular value of a particular 
characteristic depends on what segment the individual is in. In such cases, 
the formula can be written down and analyzed. In fact, the invention disclosed 
10 herein works for any score, which can then be computed in a reasonably fast 
manner by application of the invention thereto. 

To explain a score in detail, we define a set of surrogate characteristics, z u z 2 , 
z p , which are labeled "Areas for improvement" (110). Areas for 
15 improvement include, for example, such factors as too much delinquency, too 
much debt, short credit history, etc. There are p Areas for Improvement, each 
area represented mathematically by a surrogate characteristic. Typically, p < 
c. For example, see below. 

20 Use these Areas for Improvement surrogate characteristics to develop a 
surrogate score of the form 
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\\r( Zl ,z 2 ,...,z p ), 



which is developed using z u z 2 , z p as the prediction characteristics and 

as the performance (dependent) variable (120). Now consider the customer, 
5 who wants their score explained. Their actual score is 

Score* =<;(Xi, X2 — Xc) 



and their values of z 1? z 2 , z p are 

9 -^2 ' * * * ' ' 

10 A rich data feed with the values of the z's is needed, but not necessarily the 
values of the x's. 

Associated with each Area for Improvement, define the potential improvement 
metric (130) 



max{Y(Zi(z k ),...,z k ,...,zl{z k ))-Y(Zi,...zl,...,z* p )} 

/, =100X^ ; 

Score 



Maximum possible percent improvement for 
Area of Improvement k, 



where, e.g. z* 1 {z k )=z\ unless z\ cannot coexist with z k . In that case, 
z\ (z k )= e\z 7 | z k ] , or some other value of z 1 that can coexist with z k . 

5 Suppose that the values of the I's are ordered as follows: 
I 4 >I 7 >I 2 >/ n >... 

Then one could optionally create an ordered Area for Improvement table of 
the form shown in Table "A" below (140). 
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Table A: Areas for Improvement 



Areas for Improvement 


Potential Percent Improvement in Score 


4 


l 4 


7 


l 7 


2 


l ? 


11 


In 











The number of Areas for Improvement listed would be small relative to the 
15 value of p. 

Definitions of the Areas for Improvement 
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In the simple case, one could use the original prediction characteristics as the 
Areas for Improvement (152). However, in cases such as the credit bureau 
risk score, this would yield too many Areas for Improvement. 

5 Another possibility is to use the set of prediction characteristics, which are 
computed, for example, as part of Fair Isaac's Search™ Software product as 
the Areas for Improvement (154). When Search™ goes to the credit bureau 
to get information on a person, these Search™ prediction characteristics are 
returned. The Search™ Software product improves origination decision 

10 making by automatically obtaining credit bureau reports and scores from the 
major North American consumer credit bureaus, as well as providing 
comprehensive and sophisticated analysis (see 
http://www.fairisaac.com/index.html) In Search™ there are about 40 
characteristics. 

15 

Another approach is to define a set of surrogate characteristics from some 
standard categorization of the credit bureau characteristics that go into scores 
(150). The categories listed on a Fair Isaac Web page (see 
http://www.myfico.com/filfsf.html) are (i) Payment history, (ii) Amounts Owed, 
20 (iii) Length of credit history, (iv) New credit, and (v) Types of credit use. 

Suppose that the 80 original characteristics are categorized into the five 
categories above as follows: 

Category 1: x^ x 2 , x 15 
25 Category 2: x 16 , x 2 , x 35 

Category 3: x 36 , x 2 , x 50 

1 1 



Category 4: x 51l x 2 , x 65 
Category 5: x 66 , x 2 , x 80 



Then one could develop a surrogate characteristic of the form 

5 

where 

§\{ x \/ X 2'-~' X lb) 

10 

is a score developed using x 1f x 2 , x 15 as the prediction characteristics and 
the real credit score as the performance (dependent) variable (160). The 
variables z 2 , z 3 , z 5 could be developed in a similar way. 

15 This creates five possible Areas for Improvement (170), which might be too 
small. Each of the above categories could be broken into several sub- 
categories to create more Areas for Improvement. 

The three approaches described above are shown on Fig. 2 as alternatives. 

20 

Discussion 
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Reason Codes 



The following discussion of the invention is a generalization of the technique 
used today for computing reason codes. Associated with a person's score is a 
set of about four reason codes, which can be easily returned from a credit 
bureau along with the score value. In the background, associated with each 
reason code, is a score difference. These score differences are defined 
slightly differently than the score differences in the formula for \ k but they 
could be converted to percentages in the same way that the score differences 
are converted to percentages. This idea is readily implemented with a data 
feed, which includes the score differences associated with the reason codes. 

Segment Split Variables 

The above formula for l k takes into account automatically and precisely the 
split variables used to define the segments in a segmented score. The 
maximization over z k for a split variable involves the computation of the 
customer's score in several segments, because, as z k varies over its range, 
one moves from segment to segment. 

Rich Data Feed 

The presently preferred embodiment of the herein disclosed invention 
requires a rich data feed, i.e. the values of the z*'s. However, the values of the 
x*'s are not needed, unless they are the same as the z*'s. 
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Number of Areas for Improvement 



In the preferred embodiment, the total number of Areas for Improvement 
5 could be just about any number between five and 80. However, showing the 
customer the typical top four is presently preferred. The total number is 
preferably significantly more than four, i.e. five is too small. Twenty is 
presently preferred. This means that the five categories mentioned above in 
connection with the Fair Isaac site should be expanded to about twenty. The 
10 original 80 prediction characteristics should be put into about twenty 
categories. 

Generality 

15 The algorithm described herein would work for any score, which can then be 
computed in a reasonably fast manner by application of the invention herein 
thereto. This includes neural networks and regression trees. 

Although the invention is described herein with reference to the preferred 
20 embodiment, one skilled in the art will readily appreciate that other 
applications may be substituted for those set forth herein without departing 
from the spirit and scope of the present invention. Accordingly, the invention 
should only be limited by the Claims included below. 



